Randomized selection revisited 
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Abstract 

We show that several versions of Floyd and Rivest's algorithm Select for finding 
the fcth smallest of n elements require at most n + min{&:, n — k} + o(n) comparisons 
on average and with high probability. This rectifies the analysis of Floyd and Rivest, 
and extends it to the case of nondistinct elements. Our computational results confirm 
that Select may be the best algorithm in practice. 
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1 Introduction 

The selection problem is denned as follows: Given a set X := {xj}™ =1 of n elements, a 
total order < on X, and an integer 1 < k < n, find the kth smallest element of X, i.e., 
an element x of X for which there are at most k — 1 elements Xj < x and at least k 
elements Xj < x. The median of X is the |~n/2]th smallest element of X. (Since we are 
not assuming that the elements are distinct, X may be regarded as a multiset). 

Selection is one of the fundamental problems in computer science. It is used in the 
solution of other basic problems such as sorting and finding convex hulls. Hence its lit- 
erature is too vast to be reviewed here; see, e.g., |[DH UZ01] , poZ99| , poZ01|| and ||Knu98| , 



§5.3.3]. We only stress that most references employ a comparison model (in which a se- 
lection algorithm is charged only for comparisons between pairs of elements), assuming 
that the elements are distinct. Then, in the worst case, selection needs at least (2 + e)n 
comparisons |DoZ01|| , whereas the pioneering algorithm of [ |BFP + 72]j makes at most 5.43n, 
its first improvement of ||SPP76|1 needs 3n + o(n), and the most recent improvement in 
DoZ99|1 takes 2.95n + o(n). Thus a gap of almost 50% still remains between the best lower 



and upper bounds in the worst case. 

The average case is better understood. Specifically, for k < \n/2], at least n + k — 2 
comparisons are necessary ||CuM89|| , [ |Knu98| , Ex. 5.3.3-25], whereas the best upper bound 
is n + k + O^^ln 172 n) [|Knu98| , Eq. (5.3.3.16)]. Yet this bound holds for a hardly 
implementable theoretical scheme ||Knu98| , Ex. 5.3.3-24], whereas a similar frequently cited 
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bound for the algorithm Select of ||FlR75b|| doesn't have a full proof, as noted in ||Knu98 



Ex. 5.3.3-24] and ||PRKT83 
Find of [|Hoa61 



Significantly worse bounds hold for the classical algorithm 
also known as quickselect, which partitions X by using the median of a 
random sample of size s > 1. In particular, for k = \n/2], the upper bound is 3.39n + o(n) 
for s = 1 ||Knu98| , Ex. 5.2.2-32] and 2.75n + o(n) for s = 3 ||Gru99| , |KMP97|| , whereas for 
finding an element of random rank, the average cost is 3n + o(n) for s — 1, 2.5n + o(n) for 

oo [MaROl l. 



s = 3 |KMP97|] , and In + o(n) when s — > oo, s/n — ► as n 



Find is most popular, because the algorithms of |BFP + 72" 
the average [|Mus97| , [ValOO 



In practice 
SPP70I are much slower on 



For the general case of nondistinct elements, little is known 



in theory about these algorithms, but again Find performs well in practice ||VaI00 



Our aim is to rekindle theoretical and practical interest in the algorithm Select of 
(FIR75E 



^2.1] (the versions of 



2.3] and [ [FlR75a| will be addressed elsewhere). 
We show that Select performs very well in both theory and practice, even when equal 
elements occur. To outline our contributions in more detail, we recall that Select operates 
as follows. Using a small random sample, two elements u and v almost sure to be just 
below and above the kt\i are found. The remaining elements are compared with u and v 
to create a small selection problem on the elements between u and v that is quickly solved 
recursively. By taking a random subset as the sample, this approach does well against any 
input ordering, both on average and with high probability. 

First, we revise Select slightly to simplify our analysis. Then, without assuming 
that the elements are distinct, we show that Select needs at most n + min{k,n — k} + 
0(n 2 / 3 ln 1//3 n) comparisons on average; this agrees with the result of ||F1R75U §2.2] which 
is based on an unproven assumption [FRKT53| , §5]. Similar upper bounds are established 
for versions that choose sample sizes as in |[F1R754 |MehOO| , |Rei85|| and [|MoR95| , §3.3]. 
Thus the average costs of these versions reach the lower bounds of 1.5n + o(n) for median 
selection and 1.25n + o(n) for selecting an element of random rank (yet the original sample 
size of [[FlR75b| , §2.2] has the best lower order term in its cost). We also prove that nonre- 



cursive versions of Select, which employ other selection or sorting algorithms for small 
subproblems, require at most n + min{fc, n — k} + o(n) comparisons with high probability 
(e.g., 1 — 4n~ 2fB for a user-specified j3 > 0); this extends and strengthens the results of 
||GeS96| , Thm 2], |[Meh0q , Thm 2] and ||MoR95| , Thm 3.5]. 

Since theoretical bounds alone needn't convince practitioners (who may worry about 
hidden constants, etc.), a serious effort was made to design a competitive implementation 
of Select. Here, as with Find and quicksort [[3ed77| , the partitioning efficiency is crucial. 
In contrast with the observation of |[FlR75b , p. 169] that "partitioning X about both u 
and v [is] an inherently inefficient operation" , we introduce several quintary schemes which 
perform well in practice. As a byproduct, we give a modification of the ternary partitioning 
scheme of |[BeM93| , [BcS971| that obviates subscript range checking. 

Relative to Find, Select requires only small additional stack space for recursion, 
because sampling without replacement can be done in place. Still, it might seem that 
random sampling needs too much time for random number generation. (Hence several 
popular implementations of Find don't sample randomly, assuming that the input file is 
in random order, whereas others | |ValOU| j invoke random sampling only when slow progress 
occurs.) Yet our computational experience shows that sampling doesn't hurt even on 
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random inputs, and it helps a lot on more difficult inputs (in fact our interest in Select 
was sparked by the poor performance of the implementation of ||FlR75a|| on several inputs 
of ||ValO0|| ). Most importantly, even for examples with relatively low comparison costs, 
Select beats quite sophisticated implementations of Find by a wide margin, in both 
comparison counts and computing times. To save space, only selected results are reported, 
but our experience on many other inputs was similar. In particular, empirical estimates 
of the constants hidden in our bounds were always quite small. Further, the performance 
of Select is extremely stable across a variety of inputs, even for small input sizes (cf. 
§ |7.3| ). A theoretical explanation of these features will be undertaken elsewhere. For now, 
our experience supports the claim of ||FlR75b| , §1] that "the algorithm presented here is 
probably the best practical choice" . 

The paper is organized as follows. A general version of Select is introduced in §|2], 
and its basic features are analyzed in §|3[ The average performance of Select is studied 
in §[|. High probability bounds for nonrecursive versions are derived in §|^. Partitioning 
schemes are discussed in §|6|. Finally, our computational results are reported in 

Our notation is fairly standard. \A\ denotes the cardinality of a set A. In a given 
probability space, P is the probability measure, and E is the mean-value operator. 



2 The algorithm Select 

In this section we describe a general version of Select in terms of two auxiliary functions 
s(n) and g(n) (the sample size and rank gap), which will be chosen later. We omit their 
arguments in general, as no confusion can arise. 

Select picks a small random sample S from A and two pivots u and v from S such that 
u < x k <v with high probability, where x* k is the kth smallest element of A. Partitioning 
X into elements less than u, between u and v, greater than v, and equal to u or v, Select 
either detects that u or v equals x* k1 or determines a subset A of X and an integer k such 
that x* k may be selected recursively as the fcth smallest element of X. 

Below is a detailed description of the algorithm. 

Algorithm 2.1. 

Select(A, k) (Selects the kth smallest element of A, with 1 < k < n := |A|) 

Step 1 {Initiation). If n — 1, return x\. Choose the sample size s < n — 1 and gap g > 0. 

Step 2 (Sample selection). Pick randomly a sample S := {yi, . . . ,y s } from A. 

Step 3 (Pivot selection). Set i u := max{ \ks/n — g], 1}, i v : = min{ \ks/n + g] , s}. Let u 
and v be the i u th and z„th smallest elements of S, found by using Select recursively. 

Step 4 (Partitioning). By comparing each element x of A to u and v , partition A into 
A := {x G A : x < u}, B := {x G A : x = u}, C := {x G A : u < x < v}, 
D := {x G A : x = v}, E := {x G A : v < x}. If k < n/2, x is compared to v first, and to 
u only if x < v and u < v. If k > n/2, the order of the comparisons is reversed. 

Step 5 (Stopping test). If \A\ < k < \AUB\ then return u; else if \AUBUC\ < k < n-\E\ 
then return v. 
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Step 6 (Reduction) . If k < \A\, set X := A and k := k; else if n — < k, set X := E 
and /c := A; — n + else set X := C and k := k — |A U 5|. Set n := |X|. 

Step 7 (Recursion). Return Select(X, fc). 

A few remarks on the algorithm are in order. 

Remarks 2.2. (a) The correctness and finiteness of Select stem by induction from the 
following observations. The returns of Steps 1 and 5 deliver the desired element. At Step 
6, X and k are chosen so that the kth smallest element of X is the kth smallest element 
of X, and h < n (since u, v ^ X). Also \S\ < n for the recursive calls at Step 3. 

(b) When Step 5 returns u (or v), Select may also return information about the 
positions of the elements of X relative to u (or v). For instance, if X is stored as an array, 
its k smallest elements may be placed first via interchanges at Step 4 (cf. §|6|). Hence after 
Step 3 finds u, we may remove from S its first i u smallest elements before extracting v. 
Further, Step 4 need only compare u and v with the elements of X \ S. 

(c) The following elementary property is needed in §|j. Let c n denote the maximum 
number of comparisons taken by Select on any input of size n. Since Step 3 makes at 
most c s + c s _j u comparisons with s < n, Step 4 needs at most 2(n — s), and Step 7 takes 
at most with n < n, by induction c n < oo for all n. 



3 Preliminary analysis 

In this section we analyze general features of sampling used by Select. 



3.1 Sampling deviations and expectation bounds 

Our analysis hinges on the following bound on the tail of the hypergeometric distribution 
established in [|Hoe63|l and rederived shortly in [ Chv79|| . 

Fact 3.1. Let s balls be chosen uniformly at random from a set of n balls, of which r 
are red, and r' be the random variable representing the number of red balls drawn. Let 
p := r/n. Then 

P[r' >ps + g] < e- 2g2/s Vg > 0. (3.1) 

We shall also need a simple version of the (left) Chebyshev inequality |[Kor78| , §2.4.2]. 

Fact 3.2. Let z be a nonnegative random variable such that P[z < £] = 1 for some 
constant (. Then Ez < t + (P[z > t] for all nonnegative real numbers t. 



3.2 Sample ranks and partitioning efficiency 

Denote by x\ < . . . < x* n and y\ < . . . < y* the sorted elements of the input set X and 
the sample set S, respectively. Thus x* k is the fcth smallest element of X, whereas u = y* 
and v = y* at Step 3. This notation facilitates showing that for the bounding indices 

k\ := max { \k — Ignj s~\ , 1 } and k r := min { \k + 2gn/s\ , n } , (3-2) 

we have x* ki < u < x* k < v < x* kr with high probability for suitable choices of s and g. 
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Lemma 3.3. (a) P[x* k < u] < e 2g2/s if i u = \ks/n - g]. 

(b) P[u < 4 ( ] < e~ 2g2 l s . 

(c) P[v < x* k ] < e~ 2g2 l s if i v = \ks/n + g\ . 

(d) P[x* kr < v] < e- 2 'J 2 l s . 

(e) i u ^ \ks/n-g\ iff k < gn/s; i v ^ \ks/n + g] iff n < k + gn/s. 

Proof, (a) If x k < y* u , at least s — i u + 1 samples satisfy yi > with J := max x . =x « j. 
In the setting of Fact [3.1| , we have r := n — J red elements Xj > ps = s — Js/n 

and r' > s — i u + 1. Since i u = \ks/n — g] < ks/n — g + 1 and J > k, we get r' > 
P s + (J — ^)s/ri + g > ps + g. Hence P[x* k < u] < P[r' > ps + g] < e~ 2g2/s by (|3~T|) . 

(b) If y* < xt , % u samples are at most x*, where r := max x . <x * j. Thus we have r red 
elements Xj < x*, ps = rs/n and r' > i u . Now, 1 < r < ki — 1 implies 2 < ki = \k — 2gn/s~\ 
by ( |3.2| ) and thus ki < k — 2gn/s + 1, so —rs/n > —ks/n + 2g. Hence i u — ps — g > 
ks/n — g — rs/n — g > 0, i.e., r' > ps + g\ invoke (|3.1| ) as before. 

(c) If y* v < x* k) i v samples are at most x*, where r := max x * <:r * j. Thus we have r red 
elements Xj < x*, ps = rs/n and r' > i v . But i v — ps — g > ks/n + g — rs/n — g > 
implies r' > ps + g, so again (|3.1| ) yields the conclusion. 

(d) If xt < yf , s— i v +l samples are at least iL,, where J := max I » =I « j. Thus we have 

T V ' J<~ ' j fey 

r := n — J red elements Xj > = s—Js/n and r' > s — ^ + 1. Now, i„ < ks/n + g + 1 

and ]> k r > k + 2gn/s (cf. (|3!2|)) yield s — + ps — > Js/n — ks/n — g — 1 + 1— g > 0. 
Thus < v implies r' > ps + g>; hence P[x^ r < v] < P[r' > ps + g] < e~ 2g ^ s by ( |3~T1) . 

(e) Follows immediately from the properties of [•] |[Knu97| §1.2.4]. □ 

We may now estimate the partitioning costs of Step 4. We assume that only necessary 
comparisons are made (but it will be seen that up to s extraneous comparisons may be 
accomodated in our analysis; cf. Rem. |5T4](a)). 

Lemma 3.4. Let c denote the number of comparisons made at Step 4. Then 

P[ c < c] > 1 - e- 2g2/s and Ec<c + 2(n- s)e~ 2g2/s with (3.3a) 

c := n + min{ k,n — k} — s + 2gn/s. (3.3b) 

Proof. Consider the event A := {c < c} and its complement A 1 := {c > c}. If u = v then 
c = n — s < c; hence P[A'} = P[A' H {u < v}], and we may assume u < v below. 

First, suppose k < n/2. Then c = n — s + |{x G X \ S : a; < u}|, since n — s 
elements of X \ S are compared to v first. In particular, c < 2(n — s). Since k < n/2, 
c = n + k — s + 2gn/s. If v < x kr) then {x E X \ S : x < v} C {x <E X : x < v} \ {u, v} 
yields \{x G X \ S : x < v}\ < k r — 2, so c < n — s + k r — 2; since k r < k + 2gn/s + 1, we 
get c<7i + A; — s + 2^72/ s — 1 < c. Thus u < v < x* kr implies A. Therefore, A' C\ {u < v} 
implies {x* kr < v} n {u < v}, so P[A' n {u < v}] < P[x* kr < v] < e~ 2g2 l s (Lem. |J(d)). 
Hence we have ( |3.3|) , since Ec < c + 2{n — s)e~~ 2a2 / s by Fact |3]2] (with z := c, ( := 2(n — s)). 

Next, suppose k > n/2. Now c = n — s + G X \ S* : u < x}\, since n — s elements 
of X \ S are compared to u first. If x* k < u, then {x <E X \ S : u < x} c{xGX: 
u < x} \ {u,v} yields \{x G X \ S : u < x}\ < n — k[ — 1; hence k\ > k — 2gn/s gives 
c < n — s + (n — k) + 2gn/s — 1 < c. Thus ^4' D {m < f } implies {u < x*} H {m < f }, so 
P[^' n {w < u}] < P[u < x* h ] < e~ 2g2 ' s (Lem. ^(b)), and we get Q as before. □ 



5 



The following result will imply that, for suitable choices of s and g, the set X selected 
at Step 6 will be "small enough" with high probability and in expectation; we let X := 
and n := if Step 5 returns u or v , but we don't consider this case explicitly. 

Lemma 3.5. P [n < Agn/s] > 1 — Ae~ 29 ^ s , and En < Agn/s + Ane~ 2g2 l s . 



Proof. The first bound yields the second one by Fact |3.2| (with z := n < n). In each case 
below, we define an event £ that implies the event B := {h < Agn/s}. 

First, consider the middle case of i u = \ks/n — g\ and i v = \ks/n + g\. Let £ := {x* ki < 



u < x* k < v < x* kr }. By Lem. [O] and the Boole-Benferroni inequality, its complement £' 
has P[£'\ < Ae- 2g2/s } so P[£] > 1 - 4e~ 2fl2/s . By the rules of Steps 4-6, u < x\ < v implies 
X = C, whereas x* ki < u < v < x* kr yields h < k r — ki + 1 — 2; since k r < k + 2gn/s + 1 
and ki > k — 2gn/s by (p^2|), we get ri < Agn/s. Hence £ C B and thus P[£>] > P[^]. 

Next, consider the left case of i u ^ \ks/n — g], i.e., /c < (Lem. |3~3|(e)). If 

i v 7^ \ks/n + g] , then n < k + gn/s (Lem. |Q|(e)) gives n < n < k + gn/s < 2gn/s; take 
£ := {n < k + gn/s}, a certain event. For i v = \ks/n + g~\ , let £^ := {s^ < t> < x* kr }; again 
P[f ] > 1 — 2e~ 2g2 / s by Lem. |0|(c,d). Now, < v implies X C AU C, whereas v < x* kr 
gives n < k r — 1 < k + 2gn/s < 3gn/s; therefore £ C B. 

Finally, consider the right case of i v ^ \ks/n + g}, i.e., n < k + gn/s. \ii u ^ \ks/n — g\ 
then k < gn/s gives h < n < 2gn/s; take £ := {k < gn/s}. For i u = \ks/n — g~\, 
£ ■= { x * ki < u < x* k } has P[£] > 1 - 2e- 29 ^ s by Lem. p](a,b). Now, u < x* k implies 

X G C U E, whereas x* k < u yields h < n — ki with ki > k — 2gn/s and thus n < 3gn/s. 
Hence £ C B. U 

Corollary 3.6. P [c < c and h < Agn/s] > 1 - 4e~ 2f?2/s . 

Proof. Check that £ implies A in the proofs of Lems. pJTl and R73t note that n < 2gn/s 



yields c <2{n — s) < c (cf. ( |3.3b| )) in the left and right subcases. □ 

Remark 3.7. Suppose Step 3 resets i u := i v if k < gn/s, or i v := i u ii n < k + gn/s, 
finding a single pivot u — v in these cases. The preceding results remain valid. 

4 Analysis of the recursive version 

In this section we analyze the average performance of Select for various sample sizes. 
4.1 Floyd-Rivest's samples 

For positive constants a and (3, consider choosing s = s(n) and g = g(n) as 

s := mm{\af(n)],n - 1} and g := ((3s hm) 1/2 with f(n) := n 2/3 ln 1/3 n. (4.1) 



This form of g gives a probability bound e 2g I s = n 2/3 for Lems. |3.4| - [3.5| . To get more 



feeling, suppose a = (3 = 1 and s = f(n). Let 0(n) := f(n)/n. Then s/n = g/s = 4>(n) 
and n/n is at most 40(n) with high probability (at least 1—A/n 2 ), i.e., 0(n) is a contraction 
factor; note that <p(n) « 2.4% for n = 10 6 (cf. Tab. fO|). 
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Table 4.1: Sample size f(n) := n 2 / 3 In 1 / 3 n and relative sample size 0(n) := f(n)/n. 



n 


10 3 


10 4 


10 5 


10 6 


5 • 10 6 


10 7 


5 • 10 7 


10 8 


f(n) 


190.449 


972.953 


4864.76 


23995.0 


72287.1 


117248 


353885 


568986 


0(n) 


.190449 


.097295 


.048648 


.023995 


.014557 


.011725 


.007078 


.005690 



Theorem 4.1. Let C n k denote the expected number of comparisons made by Select for 
s and g chosen as in ([4. 1|) with (3 > 1/6. There exists a positive constant 7 such that 

C nk <n + mm{k,n-k} + >yf(n) VI < k < n. (4.2) 

Proof. We need a few preliminary facts. The function <fi(t) := f(t)/t = (lnt/t) 1//3 de- 
creases to on [e, 00), whereas f(t) grows to infinity on [2, 00). Let 5 := 4(/3/a) 1 / 2 . Pick 
n > 3 large enough so that e — 1 < af{n) < n — 1 and e < 5f{n). Let a := a + 1/ f{n). 
Then, by ( J4.ip and the monotonicity of / and <ft, we have for n > n 

s<af(n) and f(s)<a<t>(af(n))f(n), (4.3) 

f(5f(n)) < 5<f>(5f(n))f(n). (4.4) 
For instance, the first inequality of (|4.3|) yields f(s) < f(af(n)), whereas 

f(af(n)) = a(f)(af(n))f(n) < acj)(af(n))f(n). 



Also for n > n, we have s = \af(n)] = af(n) + e with e G [0,1) in ( [4.1|) . Writing 
s = af{n) with a := a + e/ f{n) £ [a, a), we deduce from ( |4.1| ) that 

= (/?/«) 1/2 /(^) < (/5/«) 1/2 /(^)- (4.5) 
In particular, Agn/s < Sf(n), since 5 := 4(/5/a) 1 / 2 . For (3 > 1/6, Q implies 

ne- 2fl2/s < n x - 2/3 = /(n)n 1/3 " 2/3 ln" 1/3 n < /(n) ln" 1/3 n. (4.6) 
Using the monotonicity of <fi and / on [e, 00), increase n if necessary to get 

2a<f)(af(n)) + 8<f>(5f(n)) + 40(n)n 1/3 " 2/3 ln~ 1/3 n < 0.95. (4.7) 



By Rem. |2.2j (c), there is 7 such that (|4.2j ) holds for all n < n; increasing 7 if necessary, we 
have 

2a + 25 + 8n 1/3 ~ 2f3 ln" 1/3 n < O.O57. (4.8) 

Let n' > n. Assuming ( |4.2| ) holds for all n < n', for induction let n = n' + 1. 

The cost of Step 3 can be estimated as follows. We may first apply Select recursively 
to S to find u = y* u , and then extract v = y* v from the elements y* +1 , ■ ■ ■ ,Vs (assuming 
i u < it,; otherwise v = u). Since s < n', the expected number of comparisons is 

C siu + CUi^-i, < 1.5s + 7/(s) + l-5(a - i«) + 7/(a - < ^ - 1.5 + 2 7 /(s). (4.9) 
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The partitioning cost of Step 4 is estimated by ( |3.3| ) as 

Ec < n + min{ k,n- k} - s + 2gn/s + 2ne~ 2g2/s . (4.10) 

The cost of finishing up at Step 7 is at most < l.Bn + jf(n). But by Lem. |3~5l , 
P[n > 4gn/s] < 4e~ 29 ^ s , and n < n, so (cf. Fact |3]2| with 2 := 1.5n + n ff(n)) 

E [ 1.5n + 7/(n) ] < 1.5 • 4#n/s + jf(4gn/s) + [ l.Bn + 7/(71) ] 4e _252/s . 

Since Agn/s < 5f(n), f is increasing, and /(ra) = <p(n)n above, we get 

EC ft £ < 6gn/s + jf{5f(n)) + [ 1.5 + 7 0(n) ] 4ne" 2s2/s . (4.11) 

Add the costs (fO|, QCTDD and ([CTT]) to get 

C„ fc < 3s - 1.5 + 27/(5) + n + min{ k,n - k} - s + 2gn/s + 2ne~ 2ff2/s 
+ Qgn/s + jf(Sf(n)) + [ 1.5 + j<p(n) ] Ane~ 292/s 
< n + min{ k,n - k} + 2s + 8gn/s + &ne~ 29 ' 2,s (4.12a) 

+ 7 [2f(s) + f{Sf{n)) + 4ne- 292/s <j){n) ] . (4.12b) 

By (l^)-(U), the bracketed term in ( jT2a| ) is at most 0.057/(n) due to ([Of), and that 
in (|4.12b|) is at most 0.95/ (n) from ( |4.7|) ; thus ( f4.2|) holds as required. □ 

We now indicate briefly how to adapt the preceding proof to several variations on ( |4.1| ); 
choices similar to ( [4.13|) and (^4.17|) are used in [|Meh00|| and |[FlR75a|| , respectively. 

Remarks 4.2. (a) Theorem |4.1| holds for the following modification of ( |4.1|) : 

s := min { \af(n)] ,n-l} and g := ([3s In 9s) 1/2 with f(n): = n 2/3 ln 1/3 n, (4.13) 
provided that (3 > 1/4, where 9 > 0. Indeed, the analogue of Q (cf. (PD , (pi~13|) ) 

gn/s = (l3/a) 1/2 f(n)(\n6s/\nn) 1/2 < (/3/a) 1/2 f(n)(\n6s/ \nn) l/2 (4.14) 
works like ( |4.5| ) for large n (since lim^oo = 2/3), whereas replacing ( |4. 6| ) by 

ne -^h = n ( 9s )-W < /( n )(^)-2/3 n (l-4/3)/3 ln -(l+2/3)/3 n) (4 ^ 5) 

we may replace n 1 / 3 - 213 by (a^)- 2 ^ 1 " 4 ^ 3 in (Q-Q. 

(b) Theorem |4.1| holds for the following modification of ( [4.1| ): 

s := min { \af(n)] ,n-l} and g := (/5s ln £i n) 1/2 with f(n) := n 2/3 ln e;/3 n, (4.16) 



provided either e; = 1 and (3 > 1/6, or e/ > 1. Indeed, since ( |4.16|) = (|4~1~| ) for e; = 1, 
suppose e« > 1. Clearly, (^3|) - (^75|) hold with <f)(t) := /(t)/t. For ^ > 1/6 and n large 
enough, we have g 2 /s = (3ln €l n > (3 Inn; hence, replacing 2(3 by 2/3 and In -1 / 3 by ln _e ^ 3 
in (|4.6|) - (|4.8|) , we may use the proof of Thm |4.1| . 
(c) Theorem |4.1| remains true if we use f3 > 1/6, 



s :- 



mm 



in I 



an 



2/3' 



l} , 9 := (/?slnn) 1/2 and f(n) := n 2/3 ln 1/2 



n. 



(4.17) 



Again (p[)-(P|) hold with <p(t) := f(t)/t, and ln~ 1/2 replaces ln" 1/3 in (p[)-(P 
(d) None of these choices gives f(n) better than that in ( |4.1| ) for the bound 
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4.2 Reischuk's samples 

For positive constants a and (3, consider using 

1 /2 

s := min { [an £a ] , n — 1 } and g := ( (3sn e ) 1 with 
r] : = max { 1 + (e — e s )/2, e s } < 1 for some fixed < e < e s . 



(4.18a) 
(4.18b) 



Theorem 4.3. Let C n k denote the expected number of comparisons made by Select for 
s and g chosen as in (f4.18|) . There exists a positive constant such that for all k < n 



Cnk <n + min{ k, n — k } + ^f^n) with f n (n) := n v . 



(4.19) 



Proof. The function f n (t) := t v grows to oo on (0, oo), whereas 4> n (t) := f v (t)/t = f _1 
decreases to 0, so f n and (fi n may replace / and <fi in the proof of Thm £44]. Indeed, 
picking n > 1 such that an Es < n — 1, for n > n we may use s = an €s < df v (n) with 
a < a < a := 1 + 1 /n 6s to get analogues ([4.3| )- ([4.4| ) and the following analogue of (|4.5| ) 



gn/s = {(3/a) l ' 2 n l+ ^ 2 < {(3 / 'a) 1 ' 2 f v (n). 



(4.20) 



Since g 2 /s = f3n e by ( |4.18| ), and te decreases to for t > t„ := {\^) ^ , we may 

replace (|4.6|) by 



ne- 2 ^ s = ne~ 2 ^ < n 



l-r, -2/?n f 



f v {n) > n > t„. 



(4.21) 



Hence, with n 1 17 e 2/3n£ replacing n 1 / 3 2 ^ln 1//3 n in ( |4.7|) -( ^8|) , the proof goes through. □ 



Remarks 4.4. (a) For a fixed e G (0, 1), minimizing rj in ( [4.18j ) yields the optimal sample 
size parameter 

e s := (2 + e)/3, (4.22) 

with r] = e s > 2/3 and /^(n) = n (2+e)/3 ; note that if s = an es in (|4.18|) , then g = (a(3) l/2 n t9 
with e g := (1 + 2e)/3. To compare the bounds ( |4.2j ) and ( [4 . 1 9[ ) for this optimal choice, let 
$ e (t) := (tyint) 1 / 3 , so that $ £ (t) = f n {t)/f{t) = 0„(t)/0(t). Since lim™ $ e (n) = oo, 
the choice ( |4.1| ) is asymptotically superior to ( ^.18| ). However, $ e (n) grows quite slowly, 
and § e (n) < 1 even for fairly large n when e is small (cf. Tab. |4.2| ). On the other hand, 
for small e and [3 = 1, the probability bound e~ 29 / s = e _2riE of (|4.18| ) is weak relative to 



e -2g /s _ ^-2 ensurec l by (p~l~|), 

(b) Consider using s := min{ [an £s ] , n — 1} and g := (3 l l 2 n ea with e s ,e g G (0, 1) such 
that e := 2e 3 — e s > and 77 := max{l + e fl — e s , e s } < 1. Theorem covers this choice. 
Indeed, the equality 1 + e g — e s = 1 + (e — e s )/2 shows that (|4.18b|) and ( |4.20| ) remain valid, 



and we have the following analogue of ( |4.21| ) 



ne- 2g2/s < n^e-W^'fntn) Vn > n > [(1 - rj)a/{2f3e)] 



(4.23) 



so compatible modifications of ( |4.7|) -( pi~8D suffice for the rest of the proof. Note that 
n > (2 + e)/3 by (a); for the choice e s = |, e g = ^ of ||Kei85| | , e = | and 77 = j|. 
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Table 4.2: Relative sample sizes $ e (n) and probability bounds e 





* e (n) := 


071ni) 


1/3 








exp(- 


2n e ) 










n 


10 5 10 6 


5 • 10 6 


10 7 




10 5 




10 6 


5 


IO 6 




IO 7 




1/4 


1.16 1.32 


1.45 


1.52 


3.6 


• IO" 16 


3.4 


■ IO" 28 


8.4 


1Q -42 


1.4 


■ io- 


49 


€ 1/6 


.840 .898 


.946 


.969 


1.2 


• IO" 6 


2.1 


• io- 9 


4.4 


HT 12 


1.8 


• io- 


12 


1/9 


.678 .695 


.711 


.719 


7.6 


• IO" 4 


9.3 


• IO" 5 


1.5 


IO -5 


6.2 


• 10" 


6 



4.3 Handling small subfiles 

Since the sampling efficiency decreases when X shrinks, consider the following modifica- 
tion. For a fixed cut-off parameter n cut > 1, let sSelect(X, k) be a "small-select" routine 
that finds the kth smallest element of X in at most C cut < oo comparisons when \X\ < n cu t 
(even bubble sort will do). Then Select is modified to start with the following 

Step (Small file case). If n := \X\ < n cnt , return sSelect(X, k). 

Our preceding results remain valid for this modification. In fact it suffices if C cn t 
bounds the expected number of comparisons of sSelect(X, k) for n < n cu t- For instance, 
Q4.2D holds for n < n cnt and 7 > C cut , and by induction as in Rem. [2~2|(c) we have C n k < 00 
for all n, which suffices for the proof of Thm |4.1| . 

Another advantage is that even small n cut (1000 say) limits nicely the stack space for 
recursion. Specifically, the tail recursion of Step 7 is easily eliminated (set X := X, k := k 
and go to Step 0), and the calls of Step 3 deal with subsets whose sizes quickly reach n cut . 
For example, for the choice of fl4.1| ) with a = 1 and n cut = 600, at most four recursive 
levels occur for n < 2 31 w 2.15 ■ 10 9 . 



5 Analysis of nonrecursive versions 

Consider a nonrecursive version of Select in which Steps 3 and 7, instead of Select, em- 
ploy a linear-time routine (e.g., Pick ||BFP + 72| ) that finds the ith smallest of m elements 
in at most 7pm comparisons for some constant 7p > 2. 

Theorem 5.1. Let c n k denote the number of comparisons made by the nonrecursive ver- 
sion of Select for a given choice of s and g. Suppose s < n — 1. 

(a) For the choice of fl4.1|) with f[n) := n 2//3 ln 1//3 n, we have 

P [ Cnk < n + min{ k, n — k } + 7p/(n) ] > 1 — An~ 213 with (5.1a) 

7 p := (4 7P + 2)(/5/a) 1 / 2 + (2 7p - 1) [a + l//(n)] , (5.1b) 
also with f(n) in (|5.1b|) replaced by /(3) > 2 (since n > 3). Moreover, if (3 > 1/6, then 

Ec nfc < n + min{ k, n - k } + ( 7p + 4 7p + 2 ) f(n). (5.2) 

(b) For the choice of ( |4.13j ), if 6s < n, then ( |5.1a| ) holds with n~ 2/3 replaced by 
(a^)- 2/3 n- 4/3/3 ln" 2/3/3 n. Moreover, if (3 > 1/4, then (fT^) holds with A^ P + 2 replaced 
by (4 7 p + 2)(a#)- 2/3 . 
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(c) For the choice of ( |4.18| ), ( |5.1| ) holds with f(n) replaced by f v {n) := n v and n 2/3 
by e~ 2f3nC . Moreover, if n 1_,? e _2/3nE < 1, then (|5.2| ) holds with f replaced by f v . 



Proof. The cost c n k of Steps 3, 4 and 7 is at most 2^ps + c + r yph. By Cor. p.6|, the event 
C := {c < c,n < Agn/s} has probability P[C] > 1 - Ae~ 29 ^ s . If C occurs, then 



c n k < n + min{ k,n — k} — s + 2gn/s + 2^ P s + 7p|_4gn/sJ 
< n + min{ /c, n — A; } + ( 47p + 2 ) gn/s + ( 27p — 1 ) s. 



(5.3) 



Similarly, since Ec n fc < 27ps + Ec + 7pEn, Lems. |3.4j^3.5| yield 
Ec nfc < n + min{ fc, n - k } + ( 47 P + 2 ) gn/s + ( 27 P - 1 ) s + ( 47 P + 2 ) ne~ 2!,2/s . (5.4) 



fa) Since e^ 5 



-2/3 



s = |~a;/(n)] < af(n) from s < n — 1 and ( f4.3|) , and gn/s is 



bounded by flOD , (|J) implies (p|). Then (gj) follows from O and 

(b) Proceed as for (a), invoking ( [4.14j )- (|4.15| ) instead of ( |4.5| ) and ( |4.6| ). 

(c) Argue as for (a), using the proof of Thm Ol, in particular (|4.20|)-(|4.21|). □ 



Corollary 5.2. The nonrecursive version of Select requires n + min{/c, n — k} + o(n) 
comparisons with probability at least 1 — An~ 2 ^ /or t/ie choice of ( f4.1p ; at /east 1 — 
4(a^)- 2/3 n- 4/3 / 3 /or tne caozce of Q4T3D , and at /east 1 - 4e~ 2/3nE /or tae c/iozce o/ ( ^18|) . 



Remarks 5.3. (a) Suppose Steps 3 and 7 simply sort 5* and X by any algorithm that 
takes at most 7s(slns + nlnn) comparisons for a constant 75. This cost is at most 
(s + n)7slnn, because s,h < n, so we may replace 27P by 7s Inn and 47p by 475 Inn in 
(PD-H), and hence in H)-(PD- F o r the choice of Q4l|), this yields 



P [c n k < n + min{ fc, n — A; } + 7s/(n) Inn] > 1 — 4n 2/3 with (5.5a) 

7s := (4 75 + 2 In" 1 n) (P/a) 1 ' 2 + (75 - hi" 1 n) [a + l/f(n)} , (5.5b) 
Ec n fc < n + min{ k,n — k} + (75 + 475 + 2 ln^ 1 n ) /(ra) Inn, (5.6) 

where In -1 n may be replaced by In -1 3, and ( [5.6D still needs /3 > 1/6; for the choices ( |4 . 1 3| ) 
and ( |4.18| ), we may modify ( |5.5| )-( |5T6| ) as in Thm |57^(b,c) . Corollary |5.2j remains valid. 



(b) The bound ( |5.2|) holds if Steps 3 and 7 employ a routine (e.g., Find ||Hoa61 



AH U 74j , §3.7]) for which the expected number of comparisons to find the ith smallest of 



m elements is at most 7pm (then Ec n/ t < 27ps + Ec + 7pEn is bounded as before). 



(c) Suppose Step 6 returns to Step 1 if n > Agn/s. By Cor. [T6], such loops are finite 
wp 1, and don't occur with high probability, for n large enough. 



(d) Our results improve upon |CeS96| , Thm 2], which only gives an estimate like ( p.lap , 



but with An 2/3 replaced by 0(w} 2 ^/ 3 ) ; a much weaker bound. Further, the approach of 
GeS96| , §3] is restricted to distinct elements. 



We now comment briefly on the possible use of sampling with replacement. 
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Remarks 5.4. (a) Suppose Step 2 of Select employs sampling with replacement. Since 
the tail bound (|3.1| ) remais valid for the binomial distribution [|Chv79| , [Hoe63|| , Lemma |3]3 
is not affected. However, when Step 4 no longer skips comparisons with the elements of 
S, -s in (pi) and flOoD is replaced by (cf. the proof of Lem. 2s in fl£l2aD by 3s 
and 2d in (|4.8| ) by 3a. Similarly, adding s to the right sides of (|5.3|) -(|5~4]) boils down to 
omitting —1 in (|5.1b|) and — In -1 n in (|5.5b|) . Hence the preceding results remain valid. 

(b) Of course, sampling with replacement needs additional storage for S. This is 
inconvenient for the recursive version, but tolerable for the nonrecursive ones because the 
sample sizes are relatively small (hence ( |3.3|) with —s omitted is not too bad). 

(c) Our results improve upon ||MoR95| , Thm 3.5], corresponding to ( 4.18|) with e = 1/4 
and (3=1, where the probability bound 1 — 0(n -1 / 4 ) is weaker than our 1 — 4e~ 2nl/4 , 
sampling is done with replacement and the elements are distinct. 

(d) Our results subsume ||MehOO| , Thm 2], which gives an estimate like (|5.2|) for the 
choice ( |4.13| ) with (3=1, using quickselect (cf. Rem. p.3| (b)) and sampling with replacement 
in the case of distinct elements. 



6 Ternary and quintary partitioning 



In this section we discuss ways of implementing Select when the input set is given as 
an array x[l: n\. We introduce a modification of the ternary partitioning scheme |[BelV193 



BeS97|| that obviates subscript range checking, and give extensions to quintary partitioning. 

The following notation is needed to describe the operations of Select in more detail. 

Each stage works with a segment x[l: r] of the input array x[l: n], where 1 < I < r < n 
are such that x^ < X\ for i = 1:1 — 1, x r < Xi for i = r + l:n, and the kth smallest 
element of x[l:n] is the (k — I + l)th smallest element of x[/:r]. The task of Select is 
extended: given x[l:r] and I < k < r, Select(x, I, r, k, k_, k + ) permutes x[l:r] and finds 
I < k_ < k < k + < r such that Xi < Xk for all I < i < k_, Xi = x^ for all k_ < i < k + , 
Xi > Xk for all k + < i < r. The initial call is Select(x, 1, n, k, k-, k + ). 

A vector swap denoted by x[a: b] «-> x[6+l: c] means that the first d := min(6+l— a, c—b) 
elements of array x[a: c] are exchanged with its last d elements in arbitrary order if d > 0; 



e.g., we may exchange x a+i 



for < i < d, or x a+i 



x, 



c-d+l+i 



for < i < d. 



6.1 Ternary partitions 

For a given pivot v := x^ from the array ar[/:r], the following ternary scheme partitions 
the array into three blocks, with x m < v for / < m < a, x m = v for a < m < d, x m > v for 
d < m < r. The basic idea is to work with the five inner parts of the array 



X < V 


X = V 


X < V 


? 


X > V 


X = V 


X > V 


I 


I 


p 


* j 


q 


r 


r 



until the middle part is empty or just contains an element equal to the pivot 



X = V 


X < 


V 


X = V 


X > V 


X = V 


I 


V 


J 




i q 


f 



(6.1) 



(6.2) 
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(i.e., j = i — 1 or j = i — 2), then swap the ends into the middle for the final arrangement 



x < v 



x 



X > V 



I 



(6.3) 



Al. [Initialize.] Set v := Xk and exchange x\ <-» Set i :— I :— I, p :— I + 1, j := r := r, 
g := r — 1. If v < x r , set f := r — 1. If t> > x r , exchange a;/ <-> x r and set I :— I + 1. 

A2. [Increase i until Xj > v.] Increase % by 1; then if x^ < v, repeat this step. 

A3. [Decrease j until Xj < v.] Decrease j by 1; then if Xj > v, repeat this step. 

A4. [Exchange.] (Here Xj < v < X{.) If % < j, exchange Xi Xj] then if Xi = v, exchange 
Xi <-> x p and increase p by 1; if Xj — v, exchange Xj <-> x g and decrease q by 1; return 
to A2. If z = j (so that Xj = Xj = v), increase % by 1 and decrease j by 1. 

A5. [Cleanup.] Exchange x[l:p — 1] <-> x[p:j] and x[i:g] <-> x[g + l:f]. Finally, set 
a := Z + j — p + 1 and d := f — q + i — 1. 

Step Al ensures that < v < x r , so steps A2 and A3 don't need to test whether % < j; 
thus their loops can run faster than those in the schemes of |BcM93j , [BcS971| and ||Knu97| , 
Ex. 5.2.2-41] (which do need such tests, since, e.g., there may be no element Xi > v). 



6.2 Preparing for quintary partitions 

At Step 1, r — I + 1 replaces n in finding s and g. At Step 2, it is convenient to place the 
sample in the initial part of x[l: r] by exchanging Xi <-> Xi+ ran d(r-i) for / < i < r s := l+s — 1, 
where rand(r — i) denotes a random integer, uniformly distributed between and r — i. 

Step 3 uses k u := max{ |7 — 1 + is/m — g] , 1} and k v := min{ \l — 1 +is/m + g~\ , r s } with 
i := k — l + 1 and m := r — 1 + 1 for the recursive calls. If Select(x, I, r s , k u , k~, returns 

— we have v := u := Xk u , so we only set k~ := k v , k£ := k+ and reset A;+ := k v — 1. 
Otherwise the second call Select(x, fc+ + 1, r s , produces v := x^. 

After u and f have been found, our array looks as follows 



X < u 


X = u 


u < X < V 


X = V 


X > V 


? 


I 






k v fc+ 


r s 


r 



Setting I := k u , p := k+ + 1, r := r — r s + g := r — /c+ + fe„ — 1, we exchange 
x[/c+ + 1: r s ] <r+ x[r s + l:r] and then x[/c~: <-h> x[fc+ + 1: f] to get the arrangement 



X < u 


X = u 


U < X < V 




X = V 


X > V 


I 


I 


p 


q 


f 


r 



(6.5) 



The third part above is missing precisely when u = v ; in this case ( |6.5| ) reduces to ( |6.1D 
with initial p := p, q := q, i := p — 1 and j := q + 1. Hence the case of u = v is handled 
via the ternary partitioning scheme of § |6.1| , with step Al omitted. 
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6.3 Quintary partitions 



For the case of k < |_(r + 1)/2\ and u < v , Step 4 may use the following quintary scheme 
to partition x[l: r] into five blocks, with x m < u for I < m < a, x m = u for a < m < b, 
u < x m < v for b < m < c, x m = v for c < m < d, x m > v for d < m < r. The basic idea 
is to work with the six-part array stemming from (|6.5|) 



X = u 


U < X < V 


X < u 


? 


X > V 


X = V 



p 



p 



until i and j cross 



X = u 


U < X < V 


X < u 


X > V 


X = V 



— ) 

r 



I p p j i q 

we may then swap the second part with the third one to bring it into the middle 



(6.6) 



(6.7) 



X = u 


X < u 


U < X < V 


X > V 


X = V 



(6i 



I p b c i q r 

and finally swap the extreme parts with their neighbors to get the desired arrangement 



X < u 


X = u 


u < X < V 


X = V 


X > V 


I 


a 


b c 


d 


f 



(6.9) 



Bl. [Initialize.] Set p := k v , q := q, i := p — 1 and j := q + 1. 



B2. [Increase i until Xj > v.] Increase i by 1. If Xi > v, go to B3. If < u, repeat this 
step. (At this point, u < Xi < v.) If Xi = u, exchange Xi <-> x p and x p ^ x p and 
increase p by 1; otherwise exchange Xi <-> x p . Increase p by 1 and repeat this step. 

B3. [Decrease j until Xj < v.] Decrease j by 1. If Xj > v, repeat this step. If Xj = v, 



exchange Xj «-> x q , decrease q by 1 and repeat this step. 



B4. [Exchange.] If i > j, go to B5. Exchange x^ 



If Xi = u, exchange x^ 



•' 7, 



x p and 



and x p «-> Xp and increase p and p by 1. Otherwise if x« > u, exchange x« 
increase p by 1. If Xj = v, exchange Xj <-> x g and decrease g by 1. Return to B2. 

B5. [Cleanup.] Set a := I + j — p + 1, b := p — p + i, c := j and cZ := f — q + i — 1. Swap 
x[p: p — 1] x[p: j], x[Z: p — 1] <-> x[p: b — 1], and finally x[z: g] ^> x[g + 1: f]. 

For the case of k > L( r + 0/2j and u < v , Step 4 may use the following quintary scheme, 
which is a symmetric version of the preceding one obtained by replacing ( |6.6[ )-( |6~^ ) by 



X = u 


X < u 


? 


X > f 


M < X < V 


X = V 



I 



p 



X = u 


X < u 


X > V 


U < X < V 


X = V 


I 


p J 


i q 




Q 


f 


X < u 


X = u 


U < X < V 


X > V 


X = V 


I 


a 


b 


c 


Q 


f 



(6.10) 

(6.11) 
(6.12) 



14 



CI. [Initialize.] Set p := p, q := g — k v + k£ + 1, i := p — 1 and j '■= q + 1, and swap 
x\p: k~ — 1] <-> x[£;~: g]. 

C2. [Increase z until Xj > u.} Increase i by 1. If Xj < it, repeat this step. If Xi = it, 
exchange Xi <-> x p , increase p by 1 and repeat this step. 

C3. [Decrease j until Xj < it.] Decrease j by 1. If Xj < u, go to C4. If Xj > v, repeat this 
step. (At this point, u < Xj < v.) If Xj = v, exchange Xj x q and x q <-> x q and 
decrease q by 1; otherwise exchange Xj x q . Decrease q by 1 and repeat this step. 

C4. [Exchange.] If i > j, go to C5. Exchange x, <-» Xj. If Xj = it, exchange x, <-> x p and 
increase p by 1. If Xj — v, exchange Xj <-> Xq and and decrease q and g by 

1. Otherwise if Xj > v, exchange Xj <-> x q and decrease g by 1. Return to C2. 

C5. [Cleanup.] Set a := I + j — p + 1, b := i, c := q — q + j and d := f — q + i — 1. Swap 
x[i: g] <-> x[g + 1: g], x[c + 1: g] «-> x[g + 1: f], and finally x[Z: p — 1] «-> x[p: j]. 



To make (|6.3| ) and (|6.9| ) compatible, the ternary scheme may set b := d + 1, c := a — 1. 
After partitioning / and r are updated by setting I :— b if a < k, then Z := d + 1 if c < k; 
r := c if k < d, then r := a — 1 if k < b. If I > r, SELECT may return fc_ := fc + := 
if Z = r, k- := r + 1 and k + := I — 1 if I > r. Otherwise, instead of calling Select 
recursively, Step 6 may jump back to Step 1, or Step if sSelect is used (cf. § |4.3|) . 

A simple version of sSelect is obtained if Steps 2 and 3 choose it := v := Xk when 
r — / + 1 < n cu t (this choice of |[FlR75a|| works well in practice, but more sophisticated 



pivots could be tried); then the ternary partitioning code can be used by sSelect as well. 

In fact steps A5, B5 and C5 may also share code: resetting q := q and p := p for 
A5, q := q for B5, and p := p for C5, we may swap x[p:p — 1] <-> x[p: j] if p > p, 
x[l: p — 1] <r-> x[p: p — p + j), x[i: q] <-> x[g + 1: q) if g < g, x[i + q — g: g] x[g + 1: f]. 

Even when outcomes of previous comparisons are utilized, our schemes still involve two 
extraneous comparisons (scheme A only one when i — j at A4). Consider, therefore, the 



following alternative to scheme B, also based on the arrangements (|6.6j )- (|£T9| ). 
Dl. [Initialize.] Set p := k~, q := g, i := p, j := q. 

D2. [Increase i until Xj > v.] If i > j or xi > v, go to D3. If x« = it, exchange x, «-> x p 
and x p <-> Xp, and increase p and p by 1; otherwise if Xi > u, exchange Xj <-> x p and 
increase p by 1. Increase i by 1 and repeat this step. 

D3. [Decrease j until x^ < f.] If i > j or Xj < v, go to D4. If Xj = v, exchange Xj <-> x g 
and decrease g by 1. Decrease j by 1 and repeat this step. 

D4. [Exchange.] If i > j, go to D5. Exchange Xj <-> Xj. If Xj = it, exchange Xj <-> x p 
and x p Xp and increase p and p by 1. Otherwise if Xj > it, exchange Xj <-> x p and 
increase p by 1. If Xj = v, exchange Xj x q and decrease q by 1. Increase i by 1, 
decrease j by 1, and return to D2. 

D5. [Cleanup.] Set a := I + j — p + 1, b := p — p + i, c := j and d := f — q + i — 1. Swap 
x[p: p — 1] <-> x[p: j], x[Z: p — 1] <-> x[p: 6 — 1], and finally x[i: g] <-> x[g + 1: f]. 
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Relative to scheme B (which makes r — r s + 2 comparisons to v), scheme D saves two 
u-comparisons at the expense of r — r s + 2 comparisons of % vs. j. Since r — r s 3> 2 
for usual choices of n cut , scheme B is faster than D unless the cost of key comparisons 
is extremely large. Scheme C compares in the same way with a symmetric variant of D. 
The situation with scheme A is similar, even for the small partitions produced in sSelect: 
although the schemes of ]BeM93 , [BeS97|| can save two ^-comparisons, such savings are 
insignificant when relatively few small partitions occur (cf. § [7. 31 ). 



6.4 Poor man's partitions 

We now consider a poor man's version of Select, called pmSelect, which employs less 
refined but hopefully faster partitioning. This version works with x[l:r] such that x,i < xi 
for i — 1:1 — 1, x r < Xi for i — r + 1: n, and its task is standard: given x[l: r] and I < k < r, 
pmSelect(x, /, r, k) permutes x[l: r] so that Xi < Xk for all I < i < k, and Xk < Xi for all 
k < i < r; the initial call is pmSelect(x, 1, n, k). We start with binary partitions. 

For a given pivot v := Xk from the array x[l: r], the following binary scheme partitions 
the array into three blocks, with x m < v for I < m < a, x m = v for a < m < d, v < x m 
for d < m < r; usually a = d and the middle block is singleton. 

El. [Initialize.] Set v := Xk and exchange x\ <-» Xk- Set i :— p :— I and j := r. If v > x r , 
exchange xi ^ x r and set p := r. (Thus v = xp always.) 

E2. [Increase i until Xi > v.] Increase i by 1; then if Xi < v, repeat this step. 

E3. [Decrease j until Xj < v.] Decrease j by 1; then if Xj > v, repeat this step. 

E4. [Exchange.] (Here Xj < v < X{.) If i < j, exchange and return to E2. If 

i = j (so that Xi = Xj = v ), increase i by 1 and decrease j by 1. 

E5. [Cleanup.] If r, exchange set a := j and d := i — 1; otherwise exchange 

Xi «-> Xp, set a := j + 1 and d := i. 

The setup of § |Q| changes as follows. Step 3 calls pmSelect(x, I, r s , k u ) to find u := 
Xk u , and then PMSELECT(x, k u + l,r s ,k v ) to get f := Xk v , assuming k u < k v ; then ( |6.4| ) 
changes to 



X < u 


u 


U < X < V 


V 


X > V 


? 


I 


k u 




k v 


r s 


r 



(6.13) 



Setting I := p := k v , r := r — r s + p, we exchange x[p + 1: r s ] <-> x[r s + 1: r] and then 
x p <-> x f to get the arrangement 



X < u 


u 


U < X < V 


? 




X >v 


I 


I 




p 


f 


r 



(6.14) 



If « = u, setting i := p—1 and j := r, we may use scheme E with step El omitted and E5 
replaced by A5 with q := f — 1; i.e., E5 now reads 

E5. [Cleanup.] Set a := I + j — p + 1 and d := i. Swap x[l: p — 1] <-> x[p: j] and <-> x f . 
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For the case of k < \_(r + l)/2\ and u < v , we may use the following quintary scheme, 
obtained by replacing (|6.6|) - (|6.8|) with the arrangements 



u 



U < X < V X < u 



X > V V 



V 



« 3 



U U < X < V X < U X > V V 



I 



V 3 



u 



X < u 



U < X < V 



X > V 



I 



X < u 



u 



U < X < V 



X > V 



I a b 

Fl. [Initialize.] Set i :— p — 1 and j := f. 



c d 



(6.15) 

(6.16) 
(6.17) 
(6.18) 



F2. [Increase i until Xi > v.] Increase i by 1. If Xi > v, go to F3. If X{ < u, repeat this 
step. (Here u < Xi < v.) Exchange Xi x p , increase p by 1 and repeat this step. 

F3. [Decrease j until Xj < v.] Decrease j by 1. If Xj > v, repeat this step. 

F4. [Exchange.] If % > j, go to F5. Exchange Xj <-> Xj. If Xj > u, exchange Xi <-» x p and 
increase p by 1. Return to F2. 

F5. [Cleanup.] Set a := l + i — p, b := a + 1, c := j and d := j + 1. Swap x[l + hp— 1] «-> 
x[p: j], X;- <-> x a and finally «-> x f . 

In fact scheme F may produce j < % — 1 on the first pass if x m > v for all m > k v — 1. 
For the case of A; > + Z)/2J and u < v , we may use the following quintary scheme, 
which is a symmetric version of scheme F obtained by replacing ( |6.15D — ( |6.17| ) by 



u 



X < u 



X > V 



U < X < V 



i 3 



u 



X < u 



X > V 



U < X < V 



I 



j 



X < u 



u 



U < X < V 



X > V 



(6.19) 
(6.20) 

, , ( 6 - 21 ) 

I j i d r 

Gl. [Initialize.] Set q := r — r s + 1, i := I, j := q + 1, and swap x[l + hp — l] <-> x[p: f — 1]. 

G2. [Increase i until Xj > «.] Increase i by 1. If Xj < tt, repeat this step. 

G3. [Decrease j until Xj < u] Decrease j by 1. If Xj < u, go to G4. If Xj > v, repeat this 
step. (Here u < Xj < v.) Exchange Xj <-> x g , decrease q by 1 and repeat this step. 
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G4. [Exchange.] If % > j, go to G5. Exchange If Xj < v, exchange Xj «-> x g and 

decrease g by 1. Return to G2. 

G5. [Cleanup.] Set a := j, b := a + 1, d := f — q + j and c := d — 1. Exchange xj <-> x a , 
g] <-> x[g + 1: f — 1], and finally «-> x f . 

Also scheme G may produce j < i — 1 on the first pass if x m < v for all m < q + 1. 
Schemes E, F and G are like A, B and C without their equality tests and associated 
updates. When equal elements are absent, the inequalities in (|6.13|) -( |S.21]) are strict, so 



E, F and G (although not equivalent to A, B and C) yield correct partitions for Step 4. 
When equal elements occur, the partitions of E, F and G needn't meet the requirements of 
Step 4, but are still usable. Namely, r — I shrinks when a, b, c and d are used for updating 
I and r, and sSelect may employ scheme E instead of A as in § |5.3| . In effect, pmSelect 
works like Select in the case of distinct elements, but may require more comparisons 
otherwise. In practice pmSelect tends to be slightly faster (cf. §7.3j ). 



7 Experimental results 
7.1 Implemented algorithms 

An implementation of Select was programmed in Fortran 77 and run on a notebook 
PC (Pentium II 400 MHz, 256 MB RAM) under MS Windows 98. The input set X was 
specified as a double precision array. For efficiency, the recursion was removed and small 
arrays with n < n cut were handled as if Steps 2 and 3 chose u := v := Xk~, the resulting 
version of sSelect (cf. §§ [4.3| and |6.3| ) typically required less than 3.5n comparisons. The 
choice of ( |4.1|) was employed, with the parameters a = 0.5, (3 = 0.25 and n cnt = 600 as 
proposed in [ |FlR75a| |; future work should test other sample sizes and parameters. 

A similar implementation of pmSelect was programmed as described in § |6.4| . 

For comparisons we developed a Fortran 77 implementation of the riSelect algorithm 
of ||Val00|| . Briefly, riSelect behaves like quickselect using the median of the first, middle 



and last elements, these elements being exchanged with randomly chosen ones only if the 
file doesn't shrink sufficiently fast. To ensure 0(n) time in the worst case, riSelect may 



switch to the algorithm of ||BFP + 72| , but this never happened in our experiments. Ap- 
parently riSelect represents the state-of-the-art in quickselect implementations (several 
other implementations fared worse in out tests). 

7.2 Testing examples 

We used minor modifications of the input sequences of ||Val00|| , defined as follows: 
random A random permutation of the integers 1 through n. 
onezero A random permutation of \n/2~\ ones and \n/2\ zeroes, 
sorted The integers 1 through n in increasing order. 
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rotated A sorted sequence rotated left once; i.e., (2, 3, ... , n, 1). 

organpipe The integers 1 through n/2 in increasing order, followed by n/2 through 1 in 
decreasing order. 

m3killer Musser's "median-of-3 killer" sequence with n = 4j and k = n/2: 

( 1 2 3 4 ... k-2 k-\ k k + \ ... 2k - 2 2k -I 2k \ 
\l k + 1 3 k + 3 ... 2k - 3 k — 1 2 4 ... 2k - 2 2k - 1 2k ) ' 

twofaced Obtained by randomly permuting the elements of an m3killer sequence in po- 
sitions 4|_log 2 nJ through n/2 — 1 and n/2 + 4|_log 2 nJ — 1 through n — 2. 

For each input sequence, its (lower) median element was found. 

These input sequences were designed to test the performance of selection algorithms 
under a range of conditions. In particular, the onezero sequences represent inputs con- 
taining many duplicates ||Sed77|| . The rotated and organpipe sequences are difficult for 



many implementations of quickselect. The m3 killer and twofaced sequences are hard for 



implementations with median-of-3 pivots (their original versions [|Mus97|| were modified to 



become difficult when the middle element comes from position k instead of k + 1). 
7.3 Computational results 

We varied the input size n from 50,000 to 16,000,000. For the random, onezero and 
twofaced sequences, for each input size, 20 instances were randomly generated; for the 
deterministic sequences, five runs were made to measure the solution time. 

The performance of Select on randomly generated inputs is summarized in Table 



74], where the average, maximum and minimum solution times are in seconds, and the 
comparison counts are in multiples of n; e.g., column six gives C avg /n, where C avg is the 
average number of comparisons made over all instances. Thus 7 avg := (C avg — 1.5n)//(n) 
estimates the constant 7 in the bound (|4.2|); moreover, we have C avg ~ 1.5L avg , where L avg 
is the average sum of sizes of partitioned arrays. Further, P avg is the average number of 
Select partitions, whereas iV avg is the average number of calls to sSelect and p avg is the 
average number of sSelect partitions per call; both P avg and iV avg grow slowly with Inn. 
Finally, s avg is the average sum of sample sizes; s avg / f{n) drops from 0.68 for n = 50K to 
0.56 for n = 16M on the random and twofaced inputs, and from 0.57 to 0.52 on the onezero 
inputs, whereas the initial s/f(n) « a — 0.5. The average solution times grow linearly 
with n (except for small inputs whose solution times couldn't be measured accurately), 
and the differences between maximum and minimum times are fairly small (and also partly 
due to the operating system). Except for the smallest inputs, the maximum and minimum 
numbers of comparisons are quite close, and C avg nicely approaches the theoretical lower 
bound of 1.5n; this is reflected in the values of 7 avg - Note that the results for the random 
and twofaced sequences are almost identical, whereas the onezero inputs only highlight 
the efficiency of our partitioning. 

Table [7.2| exhibits similar features of Select on the deterministic inputs. The results 
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Table 7.1: Performance of Select on randomly generated inputs. 



random 



onczcro 



twofaced 



Size 


Time [sec 


] 




Comparisons [ 


n] 


7 


IVg 


L 


avg 


p 

1 avg 


Navg 


Pavg 


^ avg 


n 


avg 


max 


min 


avg 


max 


min 






M 


[In n] 


[In n] 




[%n] 


50K 


0.01 


0.06 





01 


1 


79 


1 


84 


1 


74 


4 


91 


i 


21 


0.46 


1 


01 


7.40 


4.10 


100K 


0.02 


0.06 





01 


1 


73 


1 


77 


1 


70 


4 


77 


l 


15 


0.43 





96 


8.03 


3.20 


500K 


0.06 


0.11 





05 


1 


62 


1 


63 


1 


61 


4 


06 


i 


08 


0.56 


1 


20 


8.00 


1.86 


1M 


0.12 


0.17 





11 


1 


59 


1 


60 


1 


58 


3 


95 


i 


06 


0.67 


1 


40 


7.95 


1.47 


2M 


0.22 


0.22 





21 


1 


57 


1 


58 


1 


56 


3 


76 


i 


04 


0.76 


1 


59 


7.90 


1.16 


4M 


0.43 


0.44 





38 


1 


56 


1 


56 


1 


55 


3 


63 


i 


03 


0.95 


1 


95 


7.29 


0.92 


8M 


0.83 


0.88 





82 


1 


54 


1 


55 


1 


54 


3 


54 


i 


03 


0.98 


2 


00 


7.41 


0.72 


16M 


1.62 


1.65 


1 


59 


1 


53 


1 


54 


1 


53 


3 


39 


i 


02 


1.00 


2 


05 


7.77 


0.57 


50K 


0.01 


0.06 





01 


1 


52 


1 


52 


1 


50 





25 


i 


02 


0.28 





27 


1.17 


3.41 


100K 


0.02 


0.06 





01 


1 


51 


1 


51 


1 


50 





24 


i 


01 


0.26 





25 


1.24 


2.72 


500K 


0.07 


0.11 





05 


1 


51 


1 


51 


1 


51 





26 


i 


01 


0.23 





23 


1.15 


1.61 


1M 


0.13 


0.17 





11 


1 


51 


1 


51 


1 


51 





26 


i 


01 


0.22 





22 


1.15 


1.29 


2M 


0.27 


0.28 





22 


1 


51 


1 


51 


1 


50 





26 


i 


01 


0.28 





27 


1.09 


1.03 


4M 


0.54 


0.55 





49 


1 


50 


1 


50 


1 


50 





26 


i 


00 


0.33 





26 


1.16 


0.83 


8M 


1.02 


1.05 





98 


1 


50 


1 


50 


1 


50 





26 


i 


00 


0.38 





25 


1.10 


0.66 


16M 


2.04 


2.09 


2 


03 


1 


50 


1 


50 


1 


50 





26 


i 


00 


0.36 





24 


1.13 


0.53 


50K 


0.02 


0.06 





01 


1 


81 


1 


84 


1 


76 


5 


11 


i 


21 


0.46 


1 


02 


7.78 


4.13 


100K 


0.01 


0.06 





01 


1 


73 


1 


77 


1 


71 


4 


81 


i 


16 


0.44 





96 


8.01 


3.20 


500K 


0.07 


0.11 





05 


1 


62 


1 


63 


1 


59 


4 


10 


i 


08 


0.56 


1 


20 


8.15 


1.86 


1M 


0.11 


0.16 





10 


1 


59 


1 


60 


1 


58 


3 


89 


i 


06 


0.64 


1 


36 


7.82 


1.47 


2M 


0.22 


0.27 





22 


1 


57 


1 


58 


1 


56 


3 


63 


i 


04 


0.75 


1 


58 


7.63 


1.16 


4M 


0.42 


0.44 





38 


1 


56 


1 


56 


1 


55 


3 


57 


i 


03 


0.96 


1 


97 


7.29 


0.92 


8M 


0.83 


0.88 





82 


1 


54 


1 


55 


1 


•54 


3 


50 


i 


03 


0.97 


2 


00 


7.43 


0.72 


16M 


1.62 


1.65 


1 


59 


1 


53 


1 


54 


1 


53 


3 


40 


i 


02 


1.00 


2 


03 


7.57 


0.57 



for the sorted and rotated sequences are almost the same, whereas the solution times on the 
organpipe and m3killer sequences are between those for the sorted and random sequences. 

The performance of pmSelect on the same inputs is given in Tables |7T^ and TA. 
Select is slower than pmSelect (but not too much: about 9% on random and twofaced, 
44% on onezero, 16% on sorted and rotated, 13% on organpipe, 10% on m3killer). Except 
for timings and the onezero results, Tables |7.3| - |7.4| almost coincide with |7.1| - |7.2| . 



The performance of riSelect on the same inputs is described in Tables [7.5| and |7T6 
where iV rn d denotes the average number of randomization steps 
the cost of median-of-3 finding is negligible, 
is of order 2.75n IIK1V1P97I 



a ~ 

'-'avg ~ 



-^avgj i.e., 



Note that for riSelect, 
On the random sequences, 



the expected value of C ; 



avg 



but Table |7]5| exhibits significant 
fluctuations in the numbers of comparisons made. The results for the onezero sequences 
confirm that quicksort-like partitioning may handle equal keys quite efficiently | fSed77|| . 
The results for the twofaced, rotated and m3killer inputs are quite good, since some 
versions of quickselect may behave very poorly on these inputs [ Val00| | (note that we used 
the "sorted- median" partitioning variant as suggested in |ValOOj| ). Finally, the median-of-3 
strategy employed by riSelect really shines on the sorted inputs. 

As always, limited testing doesn't warrant firm conclusions, but a comparison of Se- 
lect and riSelect is in order, especially for the random sequences, which are most 
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Table 7.2: Performance of Select on deterministic inputs. 



Sequence 



sorted 



rotated 



organpipe 



m3killer 



Size 


Time [sec 


] 


Comparisons 


Tavg 


L 


avg 


p 

J avg 


avg 


Pavg 


^avg 


n 


avg 


max 


min 


[ 


n] 




[n] 


[In n] 


[Inn] 




[%n] 


50K 


0.02 


0.06 


0.01 


1 


79 


4.91 


1 


23 


0.46 


1.02 


8.36 


4.12 


100K 


0.01 


0.01 


0.01 


1 


73 


4.69 


1 


16 


0.43 


0.96 


8.55 


3.21 


500K 


0.04 


0.06 


0.01 


1 


60 


3.33 


1 


07 


0.61 


1.30 


7.71 


1.86 


1M 


0.09 


0.11 


0.05 


1 


57 


3.07 


1 


06 


0.65 


1.38 


6.58 


1.47 


2M 


0.14 


0.17 


0.11 


1 


56 


2.99 


1 


04 


0.76 


1.59 


7.57 


1.15 


4M 


0.28 


0.28 


0.28 


1 


55 


3.02 


1 


03 


0.99 


2.04 


8.06 


0.92 


8M 


0.55 


0.55 


0.55 


1 


54 


3.12 


1 


03 


1.01 


2.01 


7.13 


0.72 


16M 


1.05 


1.10 


1.04 


1 


53 


3.20 


1 


02 


1.02 


2.11 


7.46 


0.57 


50K 


0.01 


0.06 


0.01 


1 


80 


4.92 


1 


23 


0.46 


1.02 


8.55 


4.12 


100K 


0.01 


0.01 


0.01 


1 


73 


4.69 


1 


16 


0.43 


0.96 


8.55 


3.21 


500K 


0.02 


0.06 


0.01 


1 


60 


3.33 


1 


07 


0.61 


1.30 


7.82 


1.86 


1M 


0.09 


0.11 


0.05 


1 


57 


3.08 


1 


06 


0.65 


1.38 


6.74 


1.47 


2M 


0.15 


0.17 


0.11 


1 


56 


2.99 


1 


04 


0.76 


1.59 


7.13 


1.15 


4M 


0.28 


0.28 


0.27 


1 


55 


3.02 


1 


03 


0.99 


2.04 


7.71 


0.92 


8M 


0.55 


0.55 


0.54 


1 


54 


3.12 


1 


03 


1.01 


2.01 


7.19 


0.72 


16M 


1.05 


1.10 


1.04 


1 


53 


3.20 


1 


02 


1.02 


2.11 


7.46 


0.57 


50K 


0.01 


0.05 


0.01 


1 


83 


5.43 


1 


22 


0.46 


1.02 


8.55 


4.11 


100K 


0.03 


0.06 


0.01 


1 


74 


4.99 


1 


16 


0.43 


0.96 


6.64 


3.19 


500K 


0.04 


0.06 


0.01 


1 


62 


3.97 


1 


07 


0.61 


1.30 


6.65 


1.87 


1M 


0.11 


0.11 


0.11 


1 


59 


3.77 


1 


06 


0.72 


1.52 


7.33 


1.48 


2M 


0.19 


0.22 


0.17 


1 


56 


3.35 


1 


04 


0.76 


1.59 


6.30 


1.16 


4M 


0.34 


0.38 


0.33 


1 


55 


3.32 


1 


03 


0.92 


1.91 


6.79 


0.92 


8M 


0.66 


0.66 


0.65 


1 


54 


2.91 


1 


03 


1.01 


2.08 


7.48 


0.72 


16M 


1.26 


1.27 


1.26 


1 


53 


3.05 


1 


02 


1.02 


2.11 


7.51 


0.57 


50K 


0.01 


0.05 


0.01 


1 


80 


5.05 


1 


22 


0.46 


1.02 


7.91 


4.14 


100K 


0.01 


0.05 


0.01 


1 


74 


4.95 


1 


16 


0.43 


0.96 


6.82 


3.19 


500K 


0.05 


0.06 


0.05 


1 


63 


4.22 


1 


08 


0.61 


1.30 


7.76 


1.86 


1M 


0.11 


0.11 


0.11 


1 


60 


4.06 


1 


06 


0.58 


1.23 


7.94 


1.46 


2M 


0.17 


0.17 


0.16 


1 


57 


3.76 


1 


04 


0.69 


1.45 


8.19 


1.15 


4M 


0.36 


0.39 


0.33 


1 


56 


3.80 


1 


03 


0.99 


2.04 


7.42 


0.92 


8M 


0.69 


0.71 


0.66 


1 


55 


3.60 


1 


03 


0.94 


1.89 


8.60 


0.72 


16M 


1.34 


1.38 


1.32 


1 


54 


3.67 


1 


02 


1.02 


2.05 


7.03 


0.57 



frequently used in theory and practice for evaluating sorting and selection algorithms. On 
the random inputs, the ratio of the expected numbers of comparisons for riSelect and 
Select is asymptotically 2.75/1.5 ~ 1.83; incidentally, the ratio of their computing times 
approaches 3/1.62 « 1.85 (cf. Tabs. [Tj] and [7. 5| ) . Note that Select isn't just asymptoti- 
cally faster; in fact riSelect is about 50% slower even on middle-sized inputs. The same 
slow-down factor of about 50% is observed on the onezero sequences. The performance 
gains of Select over riSelect are much more pronounced on the remaining inputs, ex- 
cept for the sorted sequences on which Select may be slower by up to 88%. (However, 
the sorted input is quite special: increasing k by 1 (for the upper median) doubled the 
solution times of riSelect without influencing those of Select and pmSelect; e.g., for 
n = 16M the respective times were 1.12, 1.07 and 0.90). Note that, relative to riSelect, 
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Table 7.3: Performance of PmSelect on randomly generated inputs. 



random 



onczcro 



twofaced 



Size 


Time [sec 


] 




Comparisons [ 


n] 


7; 


ivg 


L 


avg 


p 

1 avg 


A avg 


Pavg 


S avg 


n 


avg 


max 


min 


avg 


max 


min 






N 


[In n] 


[In n] 




[%n] 


50K 


0.01 


0.06 





.01 


1 


.79 


1 


.84 


1 


.74 


4 


.91 


i 


.21 


0.46 


1.01 


7.40 


4.10 


100K 


0.01 


0.06 





.01 


1 


.73 


1 


.77 


1 


.70 


4 


.77 


l 


.15 


0.43 


0.96 


8.03 


3.20 


500K 


0.05 


0.06 





.05 


1 


.62 


1 


.63 


1 


.61 


4 


.06 


l 


.08 


0.56 


1.20 


8.00 


1.86 


1M 


0.11 


0.11 





.11 


1 


.59 


1 


.60 


1 


.58 


3 


.95 


l 


.06 


0.67 


1.40 


7.95 


1.47 


2M 


0.21 


0.22 





.16 


1 


.57 


1 


.58 


1 


.56 


3 


.76 


l 


.04 


0.76 


1.59 


7.90 


1.16 


4M 


0.39 


0.44 





.38 


1 


.56 


1 


.56 


1 


.55 


3 


.63 


l 


.03 


0.95 


1.95 


7.29 


0.92 


8M 


0.76 


0.77 





.71 


1 


.54 


1 


.55 


1 


.54 


3 


.54 


l 


.03 


0.98 


2.00 


7.41 


0.72 


16M 


1.49 


1.54 


1 


.48 


1 


.53 


1 


.54 


1 


.53 


3 


.39 


l 


.02 


1.00 


2.05 


7.77 


0.57 


50K 


0.01 


0.06 





.01 


1 


.60 


1 


.60 


1 


.58 


1 


.64 


l 


.10 


0.46 


1.01 


5.63 


3.72 


100K 


0.02 


0.06 





.01 


1 


.58 


1 


.58 


1 


.56 


1 


.57 


l 


.08 


0.43 


0.95 


6.06 


2.94 


500K 


0.05 


0.06 





.01 


1 


.54 


1 


.55 


1 


.52 


1 


.46 


l 


.04 


0.66 


1.39 


5.98 


1.79 


1M 


0.07 


0.11 





.05 


1 


.54 


1 


.54 


1 


.54 


1 


.49 


l 


.04 


0.67 


1.42 


6.37 


1.42 


2M 


0.19 


0.22 





.16 


1 


.53 


1 


.53 


1 


.53 


1 


.47 


l 


.03 


0.83 


1.72 


6.28 


1.14 


4M 


0.36 


0.39 





.32 


1 


.52 


1 


.52 


1 


.52 


1 


.42 


l 


.02 


1.43 


2.92 


5.37 


0.92 


8M 


0.71 


0.72 





.71 


1 


.52 


1 


.52 


1 


.52 


1 


.40 


l 


.02 


1.53 


3.13 


5.54 


0.72 


16M 


1.42 


1.43 


1 


.37 


1 


.51 


1 


.51 


1 


.51 


1 


.38 


l 


.01 


1.76 


3.58 


5.54 


0.58 


50K 


0.01 


0.06 





.01 


1 


.81 


1 


.84 


1 


.76 


5 


.11 


l 


.21 


0.46 


1.02 


7.78 


4.13 


100K 


0.02 


0.06 





.01 


1 


.73 


1 


.77 


1 


.71 


4 


.81 


l 


.16 


0.44 


0.96 


8.01 


3.20 


500K 


0.06 


0.11 





.05 


1 


.62 


1 


.63 


1 


.59 


4 


.10 


l 


.08 


0.56 


1.20 


8.15 


1.86 


1M 


0.11 


0.11 





.05 


1 


.59 


1 


.60 


1 


.58 


3 


.89 


l 


.06 


0.64 


1.36 


7.82 


1.47 


2M 


0.20 


0.22 





.16 


1 


.57 


1 


.58 


1 


.56 


3 


.63 


l 


.04 


0.75 


1.58 


7.63 


1.16 


4M 


0.39 


0.44 





.38 


1 


.56 


1 


.56 


1 


.55 


3 


.57 


l 


.03 


0.96 


1.97 


7.29 


0.92 


8M 


0.76 


0.77 





.71 


1 


.54 


1 


.55 


1 


.54 


3 


.50 


l 


.03 


0.97 


2.00 


7.43 


0.72 


16M 


1.49 


1.54 


1 


.48 


1 


.53 


1 


.54 


1 


.53 


3 


.40 


l 


.02 


1.00 


2.03 


7.57 


0.57 



the solution times and comparison counts of SELECT and PMSELECT are much more stable 
across all the inputs. This feature may be important in applications. 

Acknowledgment. I would like to thank Olgierd Hryniewicz, Roger Koenker, Ronald 
L. Rivest and John D. Valois for useful discussions. 
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Table 7.4: Performance of pmSelect on deterministic inputs. 



Sequence 


Size 


Time [sec 


] 


Comparisons 


7avg 




p 

avg 


N 

1 v avg 


Pavg 


3 avg 




n 


avg 


max 


min 


[n] 




[n] 


[Inn] 


[Inn] 




[%n] 


sorted 


50K 


0.01 


0.06 


0.01 


1.79 


4.91 


1.23 


0.46 


1.02 


8.36 


4.12 




100K 


0.03 


0.06 


0.01 


1.73 


4.69 


1.16 


0.43 


0.96 


8.55 


3.21 




500K 


0.06 


0.06 


0.06 


1.60 


3.33 


1.07 


0.61 


1.30 


7.71 


1.86 




1M 


0.07 


0.11 


0.05 


1.57 


3.07 


1.06 


0.65 


1.38 


6.58 


1.47 




2M 


0.12 


0.17 


0.11 


1.56 


2.99 


1.04 


0.76 


1.59 


7.57 


1.15 




4M 


0.25 


0.28 


0.22 


1.55 


3.02 


1.03 


0.99 


2.04 


8.06 


0.92 




8M 


0.46 


0.49 


0.44 


1.54 


3.12 


1.03 


1.01 


2.01 


7.13 


0.72 




16M 


0.90 


0.93 


0.88 


1.53 


3.20 


1.02 


1.02 


2.11 


7.46 


0.57 


rotated 


50K 


0.01 


0.06 


0.01 


1.80 


4.92 


1.23 


0.46 


1.02 


8.55 


4.12 




100K 


0.01 


0.06 


0.01 


1.73 


4.69 


1.16 


0.43 


0.96 


8.55 


3.21 




500K 


0.03 


0.06 


0.01 


1.60 


3.33 


1.07 


0.61 


1.30 


7.82 


1.86 




1M 


0.06 


0.11 


0.05 


1.57 


3.08 


1.06 


0.65 


1.38 


6.74 


1.47 




2M 


0.12 


0.17 


0.11 


1.56 


2.99 


1.04 


0.76 


1.59 


7.13 


1.15 




4M 


0.24 


0.28 


0.22 


1.55 


3.02 


1.03 


0.99 


2.04 


7.71 


0.92 




8M 


0.46 


0.50 


0.44 


1.54 


3.12 


1.03 


1.01 


2.01 


7.19 


0.72 




16M 


0.91 


0.93 


0.88 


1.53 


3.20 


1.02 


1.02 


2.11 


7.46 


0.57 


organpipc 


50K 


0.02 


0.06 


0.01 


1.82 


5.26 


1.21 


0.46 


1.02 


8.73 


4.11 




100K 


0.01 


0.01 


0.01 


1.76 


5.29 


1.17 


0.43 


0.96 


8.64 


3.20 




500K 


0.06 


0.06 


0.06 


1.62 


3.95 


1.07 


0.61 


1.30 


7.06 


1.87 




1M 


0.09 


0.11 


0.05 


1.59 


3.76 


1.06 


0.72 


1.52 


7.43 


1.48 




2M 


0.16 


0.17 


0.16 


1.57 


3.37 


1.04 


0.76 


1.59 


7.00 


1.16 




4M 


0.32 


0.33 


0.27 


1.55 


3.35 


1.03 


0.92 


1.91 


6.90 


0.92 




8M 


0.56 


0.60 


0.55 


1.54 


2.91 


1.03 


1.01 


2.08 


7.97 


0.72 




16M 


1.11 


1.15 


1.10 


1.53 


3.05 


1.02 


1.02 


2.11 


7.34 


0.57 


m3killcr 


50K 


0.01 


0.01 


0.01 


1.80 


5.05 


1.22 


0.46 


1.02 


7.91 


4.14 




100K 


0.01 


0.05 


0.01 


1.74 


4.95 


1.16 


0.43 


0.96 


6.82 


3.19 




500K 


0.05 


0.06 


0.05 


1.63 


4.22 


1.08 


0.61 


1.30 


7.76 


1.86 




1M 


0.09 


0.11 


0.05 


1.60 


4.06 


1.06 


0.58 


1.23 


7.94 


1.46 




2M 


0.17 


0.17 


0.16 


1.57 


3.76 


1.04 


0.69 


1.45 


8.19 


1.15 




4M 


0.33 


0.33 


0.33 


1.56 


3.80 


1.03 


0.99 


2.04 


7.42 


0.92 




8M 


0.61 


0.66 


0.60 


1.55 


3.60 


1.03 


0.94 


1.89 


8.60 


0.72 




16M 


1.22 


1.26 


1.21 


1.54 


3.67 


1.02 


1.02 


2.05 


7.03 


0.57 
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Table 7.5: Performance of riSelect on randomly generated inputs. 



Sequence Size Time [sec] Comparisons [n] £ avg ]V rn d 



random 



twofaced 



Tl 




max 


min 


aver 


max 


min 


77 




50K 


0.01 


0.06 


0.01 


3.10 


4.32 


1.88 


3.10 


0.40 


100K 


0.03 


0.06 


0.01 


2.61 


4.20 


1.77 


2.61 


0.25 


500K 


0.10 


0.11 


0.05 


2.90 


4.23 


1.69 


2.90 


0.20 


1M 


0.18 


0.22 


0.11 


2.81 


3.64 


1.84 


2.81 


0.35 


2M 


0.34 


0.44 


0.22 


2.60 


3.57 


1.83 


2.60 


0.30 


4M 


0.77 


1.38 


0.44 


2.88 


4.81 


1.83 


2.88 


0.55 


8M 


1.38 


1.70 


1.05 


2.60 


3.48 


1.80 


2.60 


0.45 


16M 


3.00 


4.01 


1.75 


2.99 


4.49 


1.73 


2.99 


0.45 


50K 


0.02 


0.06 


0.01 


2.73 


3.22 


2.68 


2.73 


0.00 


100K 


0.03 


0.06 


0.01 


2.72 


2.88 


2.68 


2.72 


0.00 


500K 


0.11 


0.17 


0.06 


2.74 


2.88 


2.68 


2.74 


0.40 


1M 


0.20 


0.22 


0.16 


2.72 


2.85 


2.68 


2.72 


0.55 


2M 


0.39 


0.44 


0.38 


2.71 


2.99 


2.68 


2.71 


0.75 


AM 


n 7Q 


u .00 


n 7fi 


9 73 


9 8^ 


9 fiS 


9 73 




8M 


1.62 


1.98 


1.54 


2.72 


2.88 


2.68 


2.72 


1.00 


16M 


3.13 


3.19 


3.07 


2.72 


2.85 


2.68 


2.72 


0.95 


50K 


0.03 


0.11 


0.01 


7.74 


8.45 


7.00 


7.74 


1.20 


100K 


0.05 


0.11 


0.01 


7.57 


8.35 


6.79 


7.57 


1.20 


500K 


0.17 


0.22 


0.11 


7.60 


9.25 


6.60 


7.60 


1.25 


1M 


0.35 


0.39 


0.27 


7.64 


8.61 


7.02 


7.64 


1.35 


2M 


0.70 


0.77 


0.55 


7.69 


8.55 


6.72 


7.69 


1.30 


4M 


1.39 


1.65 


1.21 


7.70 


8.98 


6.89 


7.70 


1.30 


8M 


2.80 


3.30 


2.47 


7.73 


9.12 


6.97 


7.73 


1.30 


16M 


5.39 


6.15 


4.83 


7.49 


8.34 


6.79 


7.49 


1.40 
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Table 7.6: Performance of riSelect on deterministic inputs. 



Sequence 



sorted 



rotated 



organpipe 



m3killer 



Size 


Time [sec] 


Comparisons 


-^avg 


^rnd 


n 


avg 


max 


min 


[n] 


N 




50K 


0.01 


0.01 


0.01 


1.00 


1.00 


0.00 


100K 


0.01 


0.06 


0.01 


1.00 


1.00 


0.00 


500K 


0.01 


0.01 


0.01 


1.00 


1.00 


0.00 


1M 


0.05 


0.11 


0.01 


1.00 


1.00 


0.00 


2M 


0.08 


0.11 


0.05 


1.00 


1.00 


0.00 


4M 


0.15 


0.17 


0.11 


1.00 


1.00 


0.00 


8M 


0.29 


0.33 


0.27 


1.00 


1.00 


0.00 


16M 


0.56 


0.60 


0.55 


1.00 


1.00 


0.00 


50K 


0.01 


0.06 


0.01 


3.99 


3.98 


2.00 


100K 


0.02 


0.06 


0.01 


3.97 


3.97 


2.00 


500K 


0.11 


0.16 


0.06 


4.01 


4.01 


3.00 


1M 


0.13 


0.17 


0.11 


3.96 


3.96 


2.00 


2M 


0.28 


0.33 


0.27 


3.99 


3.99 


1.00 


4M 


0.56 


0.60 


0.55 


4.00 


4.00 


3.00 


8M 


1.10 


1.10 


1.10 


3.97 


3.97 


2.00 


16M 


2.19 


2.20 


2.15 


3.96 


3.96 


2.00 


50K 


0.01 


0.06 


0.01 


9.43 


9.43 


4.00 


100K 


0.06 


0.11 


0.01 


9.73 


9.73 


4.00 


500K 


0.16 


0.17 


0.16 


8.31 


8.31 


4.00 


1M 


0.35 


0.38 


0.33 


8.53 


8.53 


5.00 


2M 


0.77 


0.77 


0.77 


9.73 


9.73 


5.00 


4M 


1.87 


1.87 


1.87 


12.10 


12.10 


5.00 


8M 


2.25 


2.26 


2.25 


7.34 


7.34 


3.00 


16M 


5.07 


5.11 


5.05 


7.88 


7.88 


3.00 


50K 


0.02 


0.06 


0.01 


7.57 


7.57 


2.00 


100K 


0.03 


0.06 


0.01 


11.52 


11.52 


2.00 


500K 


0.16 


0.17 


0.16 


7.64 


7.64 


1.00 


1M 


0.33 


0.33 


0.33 


8.00 


8.00 


1.00 


2M 


0.68 


0.71 


0.66 


8.26 


8.26 


1.00 


4M 


1.13 


1.16 


1.09 


7.15 


7.15 


1.00 


8M 


2.86 


2.86 


2.85 


9.19 


9.19 


2.00 


16M 


4.72 


4.73 


4.72 


7.43 


7.43 


2.00 
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